Beyond kappa: A review of interrater agreement measures
نویسنده
چکیده
In 1960, Cohen introduced the kappa coefficient to measure chance-corrected nominal scale agreement between two raters. Since then, numerous extensions and generalizations of this interrater agreement measure have been proposed in the literature. This paper reviews and critiques various approaches to the study of interrater agreement, for which the relevant data comprise either nominal or ordinal categorical ratings from multiple raters. It presents a comprehensive compilation of the main statistical approaches to this problem, descriptions and characterizations of the underlying models, and discussions of related statistical methodologies for estimation and confidence-interval construction. The emphasis is on various practical scenarios and designs that underlie the development of these measures, and the interrelationships between them.
منابع مشابه
Data Quality and Medical Record Abstraction in the Veterans Health Administration's External Peer Review
or Outlier Report 2001 3rd Quarter QIC Outlier Report: Key Performance Indicators for QIC 199 variable level description Chg from history Chg from previous 2001 3Q 2001 2Q 2001 1Q 2001 0Q BL HXASCVD 1 Yes extreme increase significant increase 26% 17% 0.4% 4.3% 2 No extreme decrease significant decrease 74% 83% 100% 96% TOBSTATUS 1 Current user increased increased 19% 19% 18% 18% 2 Former user i...
متن کاملComparison of in-person and digital photograph assessment of stage III and IV pressure ulcers among veterans with spinal cord
Digital photographs are often used in treatment monitoring for home care of less advanced pressure ulcers. We investigated assessment agreement when stage III and IV pressure ulcers in individuals with spinal cord injury were evaluated in person and with the use of digital photographs. Two wound-care nurses assessed 31 wounds among 15 participants. One nurse assessed all wounds in person, while...
متن کاملComparison of in-person and digital photograph assessment of stage III and IV pressure ulcers among veterans with spinal cord injuries.
Digital photographs are often used in treatment monitoring for home care of less advanced pressure ulcers. We investigated assessment agreement when stage III and IV pressure ulcers in individuals with spinal cord injury were evaluated in person and with the use of digital photographs. Two wound-care nurses assessed 31 wounds among 15 participants. One nurse assessed all wounds in person, while...
متن کاملInterrater reliability: the kappa statistic
The kappa statistic is frequently used to test interrater reliability. The importance of rater reliability lies in the fact that it represents the extent to which the data collected in the study are correct representations of the variables measured. Measurement of the extent to which data collectors (raters) assign the same score to the same variable is called interrater reliability. While ther...
متن کاملValidation of a grading system for lateral nasal wall insufficiency
This study was designed to validate a grading scheme for lateral nasal wall insufficiency with interrater and intrarater reliability measures. Representative endoscopic videos depicting varied degrees of lateral nasal wall insufficiency were collated into a 30-clip video (15 clips in duplicate). This was rated by five reviewers for a total of 150 observations. Interrater and intrarater reliabil...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008